An Approach towards Applying and Constructing Multilingual Indo-WordNet
نویسندگان
چکیده
In the work reported here, we present three important related issues. 1. We present an effective method of construction of the Marathi WordNet (http://www. cfilt.iitb.ac.in/wordnet/webmwn/) using the Hindi WordNet (http://www.cfilt. iitb.ac.in/wordnet/webhwn/), both of which are being developed at IIT Bombay. Henceforth we will refer to them as MWN and HWN respectively. 2. The Synset identity is the key to connect WordNets. 3. We present an interface to browse linked Hindi and Marathi WordNets (Bilingual WordNet) simultaneously for a given word either in Hindi or in Marathi. As an application, we present Word Sense Disambiguation (WSD) of nouns in Hindi. The system has been evaluated on the Corpora provided by Central Institute of Indian Languages (http: //www.ciil.org/) and the results are encouraging.
منابع مشابه
An Approach towards Construction and Application of Multilingual Indo-WordNet
In the work reported here, we present three important related issues. 1. We present an effective method of construction of the Marathi WordNet (http://www.cfilt.iitb.ac.in/wordnet/web mwn/) using the Hindi WordNet (http://www.cfilt.iitb.ac.in/wordnet/web hwn/), both of which are being developed at IIT Bombay. Henceforth we will refer to them as MWN and HWN respectively. 2. The Synset identity i...
متن کاملExpansion of the First Hindi-Nepali Word-Net Based Bi-Lingual Dictionary and the advancement of the Human-Machine Interface
Natural Language Processing is introducing a new era in the field of Computer Science and Machine translation. HumanMachine interaction is to play a very important role in the coming centuries as the dependency of human over the machine is increasing variably. Word-Net was first introduced by Miller and Fellbaum in 1985. WordNet is a Lexical database for the Human Languages. It groups the Human...
متن کاملWordnet creation and extension made simple: A multilingual lexicon-based approach using wiki resources
In this paper, we propose a simple methodology for building or extending wordnets using easily extractible lexical knowledge from Wiktionary and Wikipedia. This method relies on a large multilingual translation/synonym graph in many languages as well as synset-aligned wordnets. It guesses frequent and polysemous literals that are difficult to find using other methods by looking at back-translat...
متن کاملToward Constructing the National Cancer Institute Thesaurus Derived WordNet (ncitWN)
We describe preliminary work in the creation of the first specialized vocabulary to be integrated into the Open Multilingual Wordnet (OMW). The NCIt Derived WordNet (ncitWN) is based on the National Cancer Institute Thesaurus (NCIt), a controlled biomedical terminology that includes formal class restrictions and English definitions developed by groups of clinicians and terminologists. The ncitW...
متن کاملIndoWordNet
India is a multilingual country where machine translation and cross lingual search are highly relevant problems. These problems require large resourceslike wordnets and lexiconsof high quality and coverage. Wordnets are lexical structures composed of synsets and semantic relations. Synsets are sets of synonyms. They are linked by semantic relations like hypernymy (is-a), meronymy (part-of), tro...
متن کامل